AITopics | image colorization

Collaborating Authors

image colorization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

Neural Information Processing SystemsFeb-17-2026, 23:01:46 GMT

With the proposed novel sampling strategy, our model achieves instance-aware colorization in diverse and complex scenarios.

colorization, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Prompt-based Consistent Video Colorization

Dani, Silvia, Uricchio, Tiberio, Seidenari, Lorenzo

arXiv.org Artificial IntelligenceDec-1-2025

Existing video colorization methods struggle with temporal flickering or demand extensive manual input. We propose a novel approach automating high-fidelity video colorization using rich semantic guidance derived from language and segmentation. We employ a language-conditioned diffusion model to colorize grayscale frames. Guidance is provided via automatically generated object masks and textual prompts; our primary automatic method uses a generic prompt, achieving state-of-the-art results without specific color input. Temporal stability is achieved by warping color information from previous frames using optical flow (RAFT); a correction step detects and fixes inconsistencies introduced by warping. Evaluations on standard benchmarks (DAVIS30, VIDEVO20) show our method achieves state-of-the-art performance in colorization accuracy (PSNR) and visual realism (Colorfulness, CDC), demonstrating the efficacy of automated prompt-based guidance for consistent video colorization.

colorization, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2511.2233

Country:

Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

Supplmentary Material: L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

Neural Information Processing SystemsOct-9-2025, 11:46:19 GMT

To demonstrate the effectiveness of our proposed luminance-guided image compression, semantic-aligned latent representation, and instance-aware sampling strategy (details in Sec. We demonstrate our generalization capability by showing more colorization results on legacy black-and-white photos in Figure 1, where results are presented sequentially from left to right using descriptions at the complete, partial, and scarce levels. Learning to color from language.

artificial intelligence, colorization result, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
Asia > China > Beijing > Beijing (0.05)

Industry: Media > Photography (0.36)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.55)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

f3bfbd65743e60c685a3845bd61ce15f-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 11:46:15 GMT

colorization, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Understanding SOAP from the Perspective of Gradient Whitening

Lu, Yanqing, Wang, Letao, Liu, Jinbo

arXiv.org Artificial IntelligenceSep-30-2025

Shampoo with Adam in the Preconditioner's eigenbasis (SOAP) has recently emerged as a promising optimization algorithm for neural network training, achieving superior training efficiency over both Adam and Shampoo in language modeling tasks. In this work, we analyze Adam, Shampoo, and SOAP from the perspective of gradient whitening, interpreting their preconditioners as approximations to the whitening matrix, which captures second-order curvature information. We further establish a theoretical equivalence between idealized versions of SOAP and Shampoo under the Kronecker product assumption. To empirically evaluate these insights, we reproduce the language modeling experiments using nanoGPT and grayscale image colorization. Our results show that SOAP exhibits similar convergence rate as Shampoo, and no significant advantage over both Adam and Shampoo in the final loss achieved, which aligns with their equivalence in theory.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.22938

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Instance-aware Image Colorization with Controllable Textual Descriptions and Segmentation Masks

An, Yanru, Gui, Ling, Cai, Chunlei, Ye, Tianxiao, Yao, JIangchao, Zhai, Guangtao, Hu, Qiang, Zhang, Xiaoyun

arXiv.org Artificial IntelligenceSep-26-2025

Recently, the application of deep learning in image colorization has received widespread attention. The maturation of diffusion models has further advanced the development of image colorization models. However, current mainstream image colorization models still face issues such as color bleeding and color binding errors, and cannot colorize images at the instance level. In this paper, we propose a diffusion-based colorization method MT-Color to achieve precise instance-aware colorization with use-provided guidance. To tackle color bleeding issue, we design a pixel-level mask attention mechanism that integrates latent features and conditional gray image features through cross-attention. We use segmentation masks to construct cross-attention masks, preventing pixel information from exchanging between different instances. We also introduce an instance mask and text guidance module that extracts instance masks and text representations of each instance, which are then fused with latent features through self-attention, utilizing instance masks to form self-attention masks to prevent instance texts from guiding the colorization of other areas, thus mitigating color binding errors. Furthermore, we apply a multi-instance sampling strategy, which involves sampling each instance region separately and then fusing the results. Additionally, we have created a specialized dataset for instance-level colorization tasks, GPT-color, by leveraging large visual language models on existing image datasets. Qualitative and quantitative experiments show that our model and dataset outperform previous methods and datasets.

colorization, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.08705

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > Switzerland (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)

Add feedback

MTSIC: Multi-stage Transformer-based GAN for Spectral Infrared Image Colorization

Liu, Tingting, Liu, Yuan, Tang, Jinhui, Yuan, Liyin, Liu, Chengyu, Li, Chunlai, Sui, Xiubao, Chen, Qian

arXiv.org Artificial IntelligenceJun-24-2025

--Thermal infrared (TIR) images, acquired through thermal radiation imaging, are unaffected by variations in lighting conditions and atmospheric haze. However, TIR images inherently lack color and texture information, limiting downstream tasks and potentially causing visual fatigue. Existing colorization methods primarily rely on single-band images with limited spectral information and insufficient feature extraction capabilities, which often result in image distortion and semantic ambiguity. In contrast, multiband infrared imagery provides richer spectral data, facilitating the preservation of finer details and enhancing semantic accuracy. In this paper, we propose a generative adversarial network (GAN)-based framework designed to integrate spectral information to enhance the colorization of infrared images. The framework employs a multi-stage spectral self-attention Transformer network (MTSIC) as the generator . Each spectral feature is treated as a token for self-attention computation, and a multi-head self-attention mechanism forms a spatial-spectral attention residual block (SARB), achieving multi-band feature mapping and reducing semantic confusion. Multiple SARB units are integrated into a Transformer-based single-stage network (STformer), which uses a U-shaped architecture to extract contextual information, combined with multi-scale wavelet blocks (MSWB) to align semantic information in the spatial-frequency dual domain. Multiple STformer modules are cascaded to form MTSIC, progressively optimizing the reconstruction quality. Experimental results demonstrate that the proposed method significantly outperforms traditional techniques and effectively enhances the visual quality of infrared images. Unlike visible-light images, TIR images are typically grayscale, lacking both color and fine texture details [2]. The human visual system can discern thousands of hues and intensities, but only around two dozen shades of gray [3]. Prolonged viewing of grayscale images can also lead to visual fatigue, further highlighting the necessity of colorization.

colorization, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2506.1754

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Zhejiang Province (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Convolutional Deep Colorization for Image Compression: A Color Grid Based Approach

Tassin, Ian, Goebel, Kristen, Lasher, Brittany

arXiv.org Artificial IntelligenceFeb-7-2025

The search for image compression optimization techniques is a topic of constant interest both in and out of academic circles. One method that shows promise toward future improvements in this field is image colorization since image colorization algorithms can reduce the amount of color data that needs to be stored for an image. Our work focuses on optimizing a color grid based approach to fully-automated image color information retention with regard to convolutional colorization network architecture for the purposes of image compression. More generally, using a convolutional neural network for image re-colorization, we want to minimize the amount of color information that is stored while still being able to faithfully re-color images. Our results yielded a promising image compression ratio, while still allowing for successful image recolorization reaching high CSIM values.

artificial intelligence, colorization, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.05402

Country:

North America > United States > Oregon (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Transforming Color: A Novel Image Colorization Method

Shafiq, Hamza, Lee, Bumshik

arXiv.org Artificial IntelligenceOct-7-2024

This paper introduces a novel method for image colorization that utilizes a color transformer and generative adversarial networks (GANs) to address the challenge of generating visually appealing colorized images. Conventional approaches often struggle with capturing long-range dependencies and producing realistic colorizations. The proposed method integrates a transformer architecture to capture global information and a GAN framework to improve visual quality. In this study, a color encoder that utilizes a random normal distribution to generate color features is applied. These features are then integrated with grayscale image features to enhance the overall representation of the images. Our method demonstrates superior performance compared with existing approaches by utilizing the capacity of the transformer, which can capture long-range dependencies and generate a realistic colorization of the GAN. Experimental results show that the proposed network significantly outperforms other state-of-the-art colorization techniques, highlighting its potential for image colorization. This research opens new possibilities for precise and visually compelling image colorization in domains such as digital restoration and historical image analysis.

architecture, color encoder, colorization, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.3390/electronics13132511

2410.04799

Country:

Europe > Austria > Vienna (0.14)
North America > United States (0.04)
Europe > Switzerland (0.04)
Asia > South Korea > Gwangju > Gwangju (0.04)

Genre: Research Report > Promising Solution (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback